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We prove that orthogonal constructor term rewrite systems and lambda-calculus with weak 
(i.e., no reduction is allowed under the scope of a lambda-abstraction) call- by- value reduc- 
tion can simulate each other with a linear overhead. In particular, weak call-by-value beta- 
reduction can be simulated by an orthogonal constructor term rewrite system in the same 
number of reduction steps. Conversely, each reduction in an term rewrite system can be 
simulated by a constant number of beta-reduction steps. This is relevant to implicit compu- 
■ tational complexity, because the number of beta steps to normal form is polynomially related 

to the actual cost (that is, as performed on a Turing machine) of normalization, under weak 
call-by-value reduction. Orthogonal constructor term rewrite systems and lambda-calculus 
are thus both polynomially related to Turing machines, taking as notion of cost their natural 
parameters. 



CZ2 

lHj| 1 Motivations 

Implicit computational complexity is a young research area, whose main aim is the description of 



complexity phenomena based on language restrictions, and not on external measure conditions or 
on explicit machine models. It borrows techniques and results from mathematical logic (model 
theory, recursion theory, and proof theory) and in doing so it has allowed the incorporation of 
aspects of computational complexity into areas such as formal methods in software development 
and programming language design. The most developed area of implicit computational complexity 
is probably the model theoretic one - finite model theory being a very successful way to describe 
0^ . complexity classes. In the design of programming language tools (e.g., type systems), however, 

syntactical techniques prove more useful. In the last years we have seen much work restrict- 
, ing recursive schemata and developing general proof theoretical techniques to enforce resource 

bounds on programs. Important achievements have been the characterizations of several complex- 
ity classes by means of limitations of recursive definitions (e.g., [3J [TU]) and, more recently, by 
using the "light" fragments of linear logic [?]. Moreover, rewriting techniques such as recursive 
path orderings and the interpretation method have recently been proved useful in the field 
By borrowing the terminology from software design technology, we may dub this area as implicit 
computational complexity in the large, aiming at a broad, global view on complexity classes. We 
may have also an implicit computational complexity in the small — using logic to study single 
machine-free models of computation. Indeed, many models of computations do not come with 
a natural cost model — a definition of cost which is both intrinsically rooted in the model of 
computation, and, at the same time, it is polynomially related to the cost of implementing that 
model of computation on a standard Turing machine. The main example is the A-calculus: The 
most natural intrinsic parameter of a computation is its number of beta-reductions, but this very 
parameter bears no relation, in general, with the actual cost of performing that computation, since 
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a beta-reduction may involve the duplication of arbitrarily big subterm^]- What we call implicit 
computational complexity in the small, therefore, gives complexity significance to notions and 
results for computation models where such natural cost measures do not exist, or are not obvious. 
In particular, it looks for cost-explicit simulations between such computational models. 

The present paper applies this viewpoint to the relation between A-calculus and orthogonal 
(constructor) term rewrite systems. We will prove that these two machine models simulate each 
other with a linear overhead. That each constructor term rewrite system could be simulated by 
A-terms and beta-reduction is well known, in view of the availability, in A-calculus, of fixed-point 
operators, which may be used to solve the mutual recursion expressed by first-order rewrite rules. 
Here (Section U) we make explicit the complexity content of this simulation, by showing that any 
first-order rewriting of n steps can be simulated by kn beta steps, where k depends on the specific 
rewrite system but not on the size of the involved terms. Crucial to this result is the encoding of 
constructor terms using Scott's schema for numerals [19]. Indeed, Parigot [12] (see also [13]) shows 
that in the pure A-calculus Church numerals do not admit a predecessor working in a constant 
number of beta steps. Moreover, Splawski and Urzyczyn [17] show that it is unlikely that our 
encoding could work in the typed context of System F. 

Section [3] studies the converse - the simulation of (weak) A-calculus reduction by means of 
orthogonal constructor term rewrite systems. We give an encoding of A-terms into a (first-order) 
constructor term rewrite system. We write [•]$ for the map returning a first-order term, given a A- 
tcrm; [Af]$ is, in a sense, a complete dcfunctionalization of the A-term M, where any A-abstraction 
is represented by an atomic constructor. This is similar, although not technically the same, to 
the use of supercombinators (e.g., [H]). We show that A-reduction is simulated step by step by 
first-order rewriting (Theorem [T]) . 

As a consequence, taking the number of beta steps as a cost model for weak A-calculus is 
equivalent (up to a linear function) to taking the number of rewritings in orthogonal constructor 
term rewrite systems. This is relevant to implicit computational complexity "in the small" , because 
the number of beta steps to normal form is polynomially related to the actual cost (that is, 
as performed on a Turing machine) of normalization, under weak call-by-value reduction. This 
has been established by Sands, Gustavsson, and Moran [16] , by a fine analysis of a A-calculus 
implementation based on a stack machine. Constructor term rewrite systems and A-calculus are 
thus both reasonable machines (see the "invariance thesis" in [H]), taking as notion of cost their 
natural, intrinsic parameters. 

As a byproduct, in Section [5] we sketch a different proof of the cited result in [15] . Instead of 
using a stack machine, we show how we could encode constructor term rewriting in term graph 
rewriting. In term graph rewriting we avoid the explicit duplication and substitution inherent 
to rewriting (and thus also to beta-reduction) and, moreover, we exploit the possible sharing 
of subterms. A more in-depth study of the complexity of (constructor) graph rewriting and its 
relations with (constructor) term rewriting can be found in our [5]. 

In Section^] we show how to obtain the same results of the previous sections when call-by-name 
replaces call-by-value as the underlying strategy in the lambda-calculus. 

This paper is an extended version of the one with the same title appeared in the proceedings of 
ICALP 2009 6]. Besides including full proofs, it has an extended Section [5] and the new material 
of Section \§\ 

2 Preliminaries 

The language we study is the pure untyped A-calculus endowed with weak (that is, we never reduce 
under an abstraction) call-by-value reduction. 

Definition 1 The following definitions are standard: 

1 In full beta-reduction, the size of the duplicated term is indeed arbitrary and does not depend on the size of 
the original term the reduction started from. The situation is much different with weak reduction, as we will see. 
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• Terms are defined as follows: 

M ::= x | Xx.M \ MM, 

where x ranges a denwmerable set T. A denotes the set of all X-terms. We assume the existence 
of a fixed, total, order on T; this way FV(M) will be a sequence (without repetitions) of variables, 
not a set. A term M is said to be closed ifFV(M) = e, where e is the empty sequence. 

• Values are defined as follows: 

V ::= x | Xx.M. 

• Weak call-by-value reduction is denoted by —> v and is obtained by closing call-by-value reduction 
under any applicative context: 

M — >t, N M N 

(Xx.M)V ->„ M{V/x} ML -+ v NL LM ->„ LN 

Here M ranges over terms, while V ranges over values. 

• The length \M\ of M is defined as follows, by induction on M : \x\ = I, \Xx.M\ = \M\ + 1 and 
\MN\ = \M\ + \N\ + 1. 

Weak call-by-value reduction enjoys many nice properties. In particular, the one-step diamond 
property holds and, as a consequence, the number of beta steps to normal form (if any) is invariant 
on the reduction order [2] (this justifies the way we defined reduction, which is slightly more general 
than Plotkin's one [E]). It is then meaningful to define Time v (M) as the number of beta steps to 
normal form (or uj if such a normal form does not exist). This cost model will be referred to as the 
unitary cost model, since each beta (weak call-by- value) reduction step counts for 1 in the global 
cost of normalization. Moreover, notice that a-conversion is not needed during reduction of closed 
terms: if M — N and M is closed, then the reduced redex will be in the form (Xx.L)V, where 
V is a closed value. As a consequence, arguments are always closed and open variables cannot be 
captured. 

The following lemma gives us a generalization of the fixed-point (call-by-value) combinator 
(but observe the explicit limit k on the reduction length, in the spirit of implicit computational 
complexity in the small): 

Lemma 1 For every natural number n, there are terms Hi , . . . , H n and a natural number m such 
that for any sequence of values Vj., . . . , V n and for any 1 < i < n: 

HiVi ...V n ->5 Vi{Xx.H x Vi . . . V n x) . . . {Xx.H n V x . . . V n x), 

where k < m. 

Proof. The terms we are looking for are simply the following: 

Hi = Mi Mi ...M n 

where, for every 1 < j < n, 

Mj = Xxi Xx n .Xyi y n .yj(Xz.x\X X . . .x„yi . ..y n z) . . . (Xz.x n x\ . . .x n yi . ..y n z). 

The natural number m is simply In. □ 

We will consider in this paper orthogonal constructor (term) rewrite systems (CRS, see [2]). 
A constructor (term) rewrite system is a pair S = (Se,7£e) where: 

• Symbols in the signature Eh can be either constructors or function symbols, each with its 
arity. 

• Terms in C(S) are those built from constructors and are called constructor terms. 

• Terms in 7^(2, T) are those built from constructors and variables and are called patterns. 

• Terms in T(S) are those built from constructor and function symbols and are called closed 
terms. 
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• Terms in V(S, T) are those built from constructors, functions symbols and variables in T 
and are dubbed terms. 

• Rules in IZz are in the form f(pi, . . . , p„) — t where f is a function symbol, pi, . . . ,p„ G 
"P(S, T) and t G V(S, T). We here consider orthogonal rewrite systems only, i.e. we assume 
that no distinct two rules in are overlapping and that every variable appears at most 
once in the lhs of any rule in IZ3. Moreover, we assume that reduction is call- by- value, i.e. 
the substitution triggering any reduction must assign constructor terms to variables. This 
restriction is anyway natural in constructor rewriting. 

For any term t in a CRS, \t\ denotes the number of symbol occurrences, while \t\f denotes the 
number of occurrences of the symbol f in t. 

3 From Lambda- Calculus to Constructor Term Rewriting 

Definition 2 (The CRS $) The constructor rewrite system $ is defined as a set of rules 1Z& 
over an infinite signature S$ . In particular: 

• The signature £$ includes the binary function symbol app and constructor symbols c x .u for 
every M G A and every x G T. The arity of c Xi m is the length of FV(Xx.M). To every term 
M G A we can associate a term [M]$ G V($, T) as follows: 

[x]$ = x; 

[Xx.M]^ = c Xi m(xi, ■ ■ ■ ,x n ), where FV(Xx.M) = x\, ... ,x n ; 
[MNU = app([M]*, [JV]*). 

Observe that if M is closed, then [M]$ G T($). 

• The rewrite rules in 7?.$ are all the rules in the following form: 

app(c x> M(xi, • ■ • ,x n ),x) -> [M]$, 

where FV(Xx.M) = X\, . . . , x n . 

• A term t G T($) is canonical if either t G C(<&) or t = app(u, v) where u and v are themselves 
canonical. 

Notice that the signature £$ contains an infinite amount of constructors. 

Example 1 Consider the X-term M = (Xx.xx)(Xy.yy). [M]$ is t = app(c XiXa; , c yiyy ). Moreover, 
t — > a.pp(c yi yy, Cy iVy ) = u, as expected. Finally, we have u — > u. 

To any term in V(<£, T) corresponds a A-term in A: 

Definition 3 To every term f £ V($,T) we can associate a term (t)\ G A as follows: 

(x)a = x 
(app(u, v)) A = (u) A (v) A 

(Cx,m(*1, • • -*n)>A = {Xx.M){(t 1 ) A /xi, . . . , (t n ) A /x n } 

where FV(Ax.M) = x\, . . . , x n . 

Canonicity holds for terms in $ obtained as images of (closed) A-tcrms via [•]$. Moreover, canon- 
icity is preserved by reduction in $: 

Lemma 2 For every closed M G A 7 [M]$ is canonical. Moreover, if t is canonical and t-t«, 
then u is canonical. 

Proof. [M]<j, is canonical for any M G A by induction on the structure of M (which, by hypothesis, 
is either an abstraction or an application NL where both N and L are closed). We can further 
prove that v — [M]$,{t\/x\, . . . t n /x n } is canonical whenever ti,...,t n G C($) and x\,...,x n 
includes all the variables in FV(M): 
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If M = Xi, then v = U, which is clearly canonical. 
If M = NL, then 



v = [NL]a,{t 1 /xi,...t n /x n } 

= app ([iV]*{*i/a;i, . . . t n /x n }, [L}^ > {t 1 /x 1 , . . . t n /x n }) 

which is canonical, by IH. 

• If M = Xy.N, then 

v = [\y.N]n,{ti/xi,...t n /x n } 

Cy : N (^ii ; ■ ■ ■ ) ){^1 /^l 5 • • • In I %n\ 
*~y,N (tii i • ■ ■ i ^im ) 

which is canonical, because each is in £(<&). 
This implies the rhs of any instance of a rule in 1Z& is canonical. As a consequence, u is canonical 
whenever t u and t is canonical. This concludes the proof. □ 

For canonical terms, being a normal form is equivalent of being mapped to a normal form via (-)a- 
This is not true, in general: take as a counterexample c Xi!/ (app(c2 ; 2, c ZiZ )), which corresponds to 
Xx.(Xz.z)(Xz.z) via (-)a- 

Lemma 3 A canonical term t is a normal form iff (£)a is a normal form. 

Proof. If a canonical t is a normal form, then t does not contain the function symbol app and, as a 
consequence, {t)\ is an abstraction, which is always a normal form. Conversely, if (£)a is a normal 
form, then t is not in the form app(u, v), because otherwise (t)\ will be a (closed) application, 
which cannot be a normal form. But since t is canonical, t £ C($), which only contains terms in 
normal form. □ 

The following substitution lemma will be useful later. 

Lemma 4 For every term t e V($, T) and every t\, . . . , t n £ C($), 

(tih/X!, t n /x n }) A = (*)a{(*i}a/2:i, • • ■ , {t n )A/x n } 

whenever x\, . . . ,x n includes all the variables in t. 

Proof. By induction on t: 

• Ift = Xi, then 

(t{ti/xi,...,t n /x n }) A = (Xi{ti/Xi,...,t n /X„})A 

= (U)a 

= X i {(t- L )A/x 1 ,...,(t n ) A /x n } 

= t{{t 1 ) A /x 1 ,...,{t n ) A /x n }. 



If t = app(w, v), then 



(t{ti/xi, . . . ,t n /x n }) A = (app(u,v){t 1 /x 1 ,...,t n /x n }) A 

= (app(u{ii/xi, . . . , t n /x n }, v{ti/x!, t n /x n })) A 

= (u{h/xi, t n /x n }) A (v{ti/xi, . . . , t n /x n }) A 

= {u) A {(h) A /xi, (t n ) A /x n }(v) A {(ti) A /xi, (t n ) A /x n } 

= (u) A (v) A {(ti) A /xi, . . . , (t n ) A /x n } 

= (app(u, u))A{(*i)A/ari, . . . , (t n ) A /x n } 

= (t) A {(ti) A /xi,...,(t n ) A /x n }. 
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• If t = c Vt pf(ui, . . . ,u m ), then 



{t{h/xi, . . . , t n /x n })\ = (cy tN (ui, . . . , u m ){ti/xi, . . . , t n /x n })A 

= {Cy,N{u\{ti/xi, . . . , t n /x n }, . . . , U m {ti/Xi, . . . , t n /x n })) A 

= (A2/JV){(ui{ii/xi, . . . ,t n /x n }) A /x il 

, ... 5 

{tt/x!, t n /x n }) A /x im } 

= (Xy.N){(ui) A {(ti) A /xi, (t n ) A /x n }/x ll 

, . . . j 

(u m ) A {(tl) A /xi, (t n ) A /x n }/x im } 

= ((\y.N){(ui) A /xi, . . . ,u m /x ll }){(t 1 ) A /xi, . . . , {t n ) A /x n } 
= (c v ,n(ui,- ■ ■ , u m )) A {(ti) A /xi, . . . , (t n ) A /x n } 
= (t) A {(ti} A /x 1 ,...,(t n } A /x n }. 

This concludes the proof. □ 

Lemma 5 For every X-term M 6 A, ([M]$)a = M. 

Proof. By induction on M: 

• If M = x, then 

([M]$)a = <N*)a = (^)a = x. 

• If M = NL, then 

<[M]*) A = <app([A]„, [i]*)) A = <[A] $ } a <[L]$}a - AL. 

• If M = Ay.AT, then 

([M]$) A = (cj, ; jv(a;i, • • • , x n )) A = (Xy.N){xi/x±,. . . , x„/x n } = Ay.N = M. 

This concludes the proof. □ 

The previous two lemmas implies that if M £ A, t\, . . . , t n € C($) and xi, . . . , x n includes all the 
variables in FV(M), then: 

([M]${ii/xi, . . . ,t n /x n }) A = M{(ti) A /xi, (t n ) A /x„}. (1) 

Reduction in $ can be simulated by reduction in the A-calculus, provided the starting term is 
canonical. 

Lemma 6 If t is canonical and t — > u, then (t) A — »•„ (u) A . 

Proof. Consider the (instance of the) rewriting rule which turns t into u. Let it be 
app(cj,,M(*i) ■ ■ - ,tn),v) [Af]${ii/xi, . . . ,t n /x n ,v/y}. 

Clearly, 

(app(c y ,M(*i, ■ ■ ■ ,tn),v)) A = ((Xy.M){ti/xi, . . . , t n /x n })(v) A 

while, by Jl}: 

([M]${ti/xi, . . .,t n /x n ,v/y}) A = M{{t-C) A /xi, . . . , (t n ) A /x ni (v) A /y} 
which implies the thesis. □ 
Conversely, call-by- value reduction in the A-calculus can be simulated in <f>: 
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Lemma 7 If M — >„ TV, t is canonical and {t}\ — M, then t — ► u, where (u)a = N. 

Proof. Let (Xx.L)V be the redex fired in M when rewriting it to N. There must be a corre- 
sponding subterm v of t such that (v)\ = (\x.L)V . Then 

u = app(c X) p(ti, . . . ,t n ),w), 

where (c x ,p{ti, . . . ,t n ))\ = Xx.L. and (w}\ = V. Observe that, by definition, 

(Cx,p(tl, • ■ ■ ,*ti))a = (\x.P){(t 1 ) A /xi, (t n ) A /x n } 

where FV(P) = Xi, . . . , x n . Since t is canonical, ti, . . . ,t n £ C($). Moreover, since V is a value, w 
itself is in C(<I>).This implies 

app(c ;Ei p(ii, . . .,t n ),w) -> [P]${ti/xi, . . . ,t n /x n ,w/x}. 

By ©: 

([P]${tl/xi,...,t„/x„,^/x})A = P{(tl)A/xi,...,(t n )A/x n ,(w) A /x} 

= (P{(tl)\/Xl, . . . , (tn}A/a:n}){(w)A/a;} 

= (Ax.L){V/x}. 

This concludes the proof. □ 

The previous lemmas altogether imply the following theorem, by which A-calculus normalization 
can be mimicked (step-by-step) by reduction in $: 

Theorem 1 (Term Reducibility) Let M 6 A be a closed term. The following two conditions 
are equivalent: 

1. M — >™ N where N is in normal form; 

2. [M]$ — t where (t) A = N and t is in normal form. 

Proof. Suppose M — »™ N, where N is in normal form. Then, by applying Lemma [71 we obtain 
a term t such that [M]$ t and = N. By LemmaEJ t is canonical and, by Lemma[3j it is 
in normal form. Now, suppose [M]$ — > n t where (t) A = N and t is in normal form. By applying 
n times Lemma [HI we obtain ([M]$)a — >" (t}\ = N. But ([M]$)a = M by Lemma [5] and N is a 
normal form by Lemma [3J since [M] $ and t are canonical by Lemma [5] □ 

There is another nice property of that will be crucial in proving the main result of this paper: 

Proposition 1 For every M £ A, for every t with — >* t and for every occurrence of a 
constructor c^jy in t, N is a subterm of M . 

Proof. Assume [M]$ — t and proceed by induction on n. □ 

Example 2 Let us consider the X-term M = (Xx.(Xy.x)x)(Xz.z). Notice that 

M — ►„ (Xy.(Xz.z))(Xz.z) — ►„ Xz.z. 

Clearly [M]« = appfcj-^j.^^c^,). Moreover: 

app(c a , j (Aj,. a .) a! , c Z)Z ) -> app(c y ^(c ZiZ ),c Zi2 ) -* c z>z . 

For every constructor c Wt jf occurring in any term in the previous reduction sequence, N is a 
subterm of M . 
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A remark on <I> is now in order. <I> is an infinite CRS, since contains an infinite amount 
of constructor symbols and, moreover, there are infinitely many rules in 1Z$. As a consequence, 
what we have presented here is an embedding of the (weak, call-by-value) A-calculus into an 
infinite (orthogonal) CRS. Consider, now, the following scenario: suppose the A-calculus is used 
to write a program M, and suppose that inputs to M form an infinite set of A-terms which can 
anyway be represented by a finite set of constructors in $. In this scenario, Proposition [T] allows 
to conclude the existence of finite subsets of E$ and lZ<s> such that every MN (where N £ 6) can 
be reduced via $ by using only constructors and rules in those finite subsets. As a consequence, 
we can see the above schema as one that puts any program M in correspondence to a finite CRS. 
Finally, observe that assuming data to be representable by a finite number of constructors in 
$ is reasonable. Scott's scheme [H], for example, allows to represent any term in a given free 
algebra in a finitary way, e.g. the natural number becomes [0] = c y! \z.z while n + 1 becomes 
\n+ 1] = c y ^ z , yx (\n\). Church's scheme, on the other hand, does not have this property. 

4 From Constructor Term Rewriting to Lamb da- Calculus 

In this Section, we will show that any rewriting step of a constructor rewrite system can be 
simulated by a fixed number of weak call-by- value beta-reductions. 

Let 3 be an orthogonal constructor rewrite system over a finite signature £a- Let Cx, . . . , c g be 
the constructors of 3 and let fx, . . . , f/, be the function symbols of 3. The following constructions 
work independently of 3. 

We will first concentrate on constructor terms, encoding them as A-terms using Scott's schema [T5] . 
Constructor terms can be easily put in correspondence with A-terms by way of a map ((-))a defined 
by induction as follows: 

((cj(tx • • •,*«))) A = Axx Xxg.Xy.Xi((ti))\ . . . ((*„))a- 

This way constructors become functions: 

((c 4 ))a = Axi \x ar ( Ct ).\yi Xy g .\z.y t xi . . . x ar(Ci) . 

Trivially, ((ci))\((ti))A . . . ((t n ))A rewrites to ((ci(ti . . . t n )))A in ar(cj) steps. To represent an error 

value, we use the A-term _L = \x\ Xx g .\y.y. A A-term built in this way, i.e. a A-term which 

is cither _L or in the form ((t})\ is denoted with metavariables like X or Y. 

The map ((-))a defines encodings of constructor terms. But what about terms containing 
function symbols? The goal is defining another map [-]a returning a A-term given any term t in 
T(3), in such a way that t — >* u and u £ C(5) implies [t]\ — ►* ((u))\. Moreover, [t]\ should rewrite 
to _L whenever the rewriting of t causes an error (i.e. whenever t has a normal form containing a 
function symbol). First of all, we can define the A-term [c,]a corresponding to any constructor 
Ci. To do that, define a A-term M % Xi x for every 1 < % < g, for every < m < or(cj) and for 
every variables x\, . . . , x m by induction on or(cj) — m: 

Xyi Xy g .yiXi . ..x ar{c .y, 

A//.//A I", . . . A'."',/."': 

where: 

Xz i Az^ (ej) .(Ai T ^i.A^ li ... iiBm+1 )M^2^ ) ; 

Xz m ..2 Xz ar 

Lemma 8 There is a constant n £N such that for every i and for every m: 

Mi u ... tXm {{{tx))K/xi, ((i m »A/Zm}((Wl»A • • • «tor(ci)»A ^ ((Ci(*X • ••*ar(c < ))»A 



Ml 



Vm : < m < ar(ci) 



Ml 



N m = 
LT = 
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(where k < n) and 

Mi lt ...,x m {((ti))A./Bi, ((t m })\/x m }X m+ i . ..X ar{Ci) _L 
(where I < n) whenever Xj is either a or _L but at least one among X m+ \ . . .X ar i Ci \ is _L. 

Proof. We proceed by induction on ar(cj) — to: 

• If to = ar(cj), then 

^....^.(^{(('iK/H. ■ • • . ({tar( Ci )))A/x ar{Ct) } 
= (A?/l XygViXi . . ■ X ar ( c . ) ){((t 1 )) A /x 1 , . . . , ((t ar ( Ci )))A/Xar(ci)} 

= Ayi Xy g .yi((ti))A ■ ■ ■ ((t a r( Ci )})A 

= ((Ci(ii, . . . ,t r(C())))A- 

• If to < ar(cj), we use the following abbreviations: 

P™ = i\^{«ti)>A/»i,...,«U>A/a: m }; 
Of = Lf{((t 1 )) A /x 1 ,...,((t m )) A /x m }. 

Let's distinguish two cases: 

• If X m+ i = L, then: 

M xi,...,x m {((*i))A/a;i, • • ■ , {{t m ))h/x m }X m+1 . ..X ar(Ct ) 

— >„ (-L-Pi™ • ■ • P™ i Q™)^m+2 • • • -X"ar(c<) 
— >* Q-"^m+2 • ■ • X ar ( Ci ) 

->* _L 

V 

• Let X m+ i be ((t rn+1 )) A , where i m+ i = Cj(ui, . . . , u ar{c .)). Then: 

M Kl ,...,x m {((*i)>A/a;i, • • ■ , ((t m ))A/x m }X m+1 . ..X ar{ci) 

—> v (((Cj(ui, . . . , U or ( C:( )))>A^l7i • ■ • ^™.Qr)^m+2 • • -^ar(Ci) 
"^J Pj!A u lflA ■ ■ ■ ((Uar( Cj ))) AX m+2 ■ ■ ■ X ar ( c .) 

(Ax m+1 .M^ ij ...^ +i {((t 1 )) A /x 1 , . . . , ((t m » A /z ro }) 

(^.....^(^{(("lWS/l, ■ • • , ((*or(c^)»A/yor(c i )})^m+2 • • • X ar(Cz) 

(Xx m+1 .M I Xu _ Xm+i {((t 1 )) A /x 1 , {{t m )) A /x m }) 

(((Cj(ui, . . . ,U ar ( c .)))) A )X m+2 . . .X ar ( c .) 
~**v M l 1 ,...,x m + 1 {i t l)) a/ x x , {{t m+1 )) A /x m+ i}X m+2 . ..X ar{ci) 

and, by the inductive hypothesis, the last term in the reduction sequence reduces to the 
correct normal form. The existence of a natural number n with the prescribed properties 
can be proved by observing that none of the reductions above have a length which depends 
on the parameters {(h)) A , {{t m )) A and X rn+1 . . . X ar{ci) . 
This concludes the proof. □ 

So, the required lambda term [ci\ A is simply M\. Interpreting function symbols is more difficult, 
since we have to "embed" the reduction rules into the A-term interpreting the function symbol. 
To do that, we need a preliminary result to encode pattern matching. 

Lemma 9 (Pattern matching) Let a>i,...,a n be non- overlapping sequences of patterns of the 
same length to. Then there are a term M™ Un and an integer I such that for every sequence of 
values V±, . . . , V n , if at — pi, . . . ,p m then 



m™, ..., an «Pi(*i, • • • ,^)»a • • • «p m (4, • • ■,t k -))) A v 1 ...v n 
v4t{)) A ...((t k s)) A ...((tl n )) A ...((tt))A, 
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where k < I, whenever the t\ are constructor terms. Moreover, 

M™_ an X u ...,X m V 1 ...V n ^ k v ±, 

where k < I, whenever X\, . . . , X m do not unify with any of the sequences ct\, . . . , a n or any of 
the Xi, . . . , X m is itself J- . 

Proof. We go by induction on p — Y^i=i \\ a i\\> where ||ai|| is the number of constructors occur- 
rences in patterns inside a*: 

• If p = and n = 0, then we should always return _L: 

M £ m eee Axi Xx m .±. 

• If p = and n = 1 and oti is simply a sequence of variables Xi, . . . , x m (because the oti are 
assuming to be non-overlapping). Then M™ x is a term defined by induction on m which 
returns _L only if one of its first m arguments is _L and otherwise returns its m + 1-th argument 
applied to its first m arguments. 

• If P > 1 , then there must be integers i and j with 1 < i < m and 1 < j < n such that 

etj = pi, . ,pj_i,Cfe(ri, . . . , r ar ( Cfc )),p i+ i, . . . ,p m 

for a constructor and for some patterns p p and some r q . Now, for every 1 < p < n and for 
every 1 < j < g we define sequences of patterns f3 3 p and values as follows: 

• If 

a p = pi,.. . , Pi_i,Cj(ri, . . . ,r ar ( C3 )),pj + i . . .p m 
then (3 3 p is defined to be the sequence 

Pi, • • • ,Pi-i,ri, . . . , r ar ( Cfc ),Pi+i, . . . ,p m . 

Moreover, W p is simply the indentity Xx.x. 

• If 

a p = pi,. .. , Pi_i,c s (ri, . . . , r ar ( Cs )),p i+ i . . . p m 

where s ^ j then f3 3 p and W 3 are both undefined. 

• Finally, if 

a p = pi, . . . ,pi-i,x,p i+1 ...p m 
then (3 3 p is defined to be the sequence 

Pi , • ■ • , Pi— 1) 2-1 1 • ■ ■ i 2 ; ar(cj ) ? Pi+1 ; ■ ■ ■ j Pm- 

and W 3 is the following A-term 

Xx.Xyx Xyt-X! Ax ar(Cfc) .Azi Xz u .xy x . . . y t (((cj)) A Xi . . . x ar(cj) )zi ...z u 

where t is the number of variables in pi, . . . ,Pi-i and u is the number of variables in 

Pi+1> • ■ • j Pm- 

As a consequence, for every 1 < j < g, we can find a natural number tj and a sequence of 
pairwise distinct natural numbers i\, . . . , i tj such that [3^ , . . . , [3j t are exactly the sequences 

which can be defined by the above construction. We are now able to formally define M™ ... „ ; 
it is the term 

Xxi Xx m .Xy! Xy n .((xiVi . . . V g V±)xi . . . Xi-\x i+ \ . . . x m )yi ...y n 

where 

Vl<j<g-Vj = Xzi Xz ar ( Cj ).Xxi Xxi^x.Xx i+1 Xxm.Xyx Xy n . 

M 7 j ~ 1 ^ (C3)xi ' ' ' Xl ~ lZl ' ' ' Zar ^) Xi + 1 ■ ■ ■ x ™( w ly*i) ■ ■ ■ ( w i tj ) 

»1 ' Hj 

V± = Xxi Xxi^i.Xx i+ i Xx m .Xyi Ay„._L 

Notice that, for every j, p > Y^v=i WPiW- Moreover, for every j any (3 3 v has the same length 
m — 1 + ar(cj). This justifies the application of the induction hypothesis above. 
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This concludes the proof. □ 
For every function symbol fi, let 

be the rules for f^. Moreover, suppose that the variables appearing in the patterns in oq are 
zf' , . . . , zl'™ 1 ' 3 . Recall that we have a signature with fi, . . . , f/j function symbols. For any 1 < i < h 
the lambda term interepreting is defined to be: 

[fj] A = ...V h 

where 

Vi = Xx! Xxh.Xy! Xy ar( t i ).M a i^^ ot ny 1 ...y ar ^Wl ...W?* 

wi = \zi xz mi:j .i\iu 

whenever 1 < i < h and 1 < j < n^. Moreover (| • Da is defined by induction as follows: 

t\x\/ A = x 

<|Ci(ii, . . . ,t ar ( Ci ))\)A = [Ci]a{|*iDa ■ ■ • ^ ar ( Ci )\)A 
§fi(ti, . . . ,t r(fi))DA = ^id^lDA • • • ^or(f<)DA 

Now, we have all the necessary ingredients to extend the mapping [-]a to every term in T(S): 

[c(tl, . . . ,t ar ( c .))] A = [Ci] A [il]A ■ ■ ■ [*or(ci)]A 
[fj(il, . . . ,t a r(f«))]A = [fi]A[£l]A--- [*or(f«)]A 

Theorem 2 There is a natural number k such that for every function symbol f and for every 
ti, ■ ■ ■ , t ar (f) £ the following three implications hold (where u stands for f{t\, . . . , i a r(f)) ond 

M stands for [%(<*!.» A • ■ ■ ((t ar (f)))A): 

• If u rewrites to v £ C(H) in n steps, then M rewrites to ((v))a in at most kn steps. 

• If u rewrites to a normal form v ^ C(S), then M rewrites to _L. 

• If u diverges, then M diverges. 

Proof. By an easy combinatorial argument following from the definition of [-]a- d 
Clearly, the constant k in Theorem [2] depends on S, but is independent on the particular term u. 

5 Graph Representation 

The previous two sections proved the main simulation result of the paper. To complete the picture, 
we show in this section that the unitary cost model for the (weak call-by-value) A-calculus (and 
hence the number of rewriting in a constructor term rewriting system) is polynomially related to 
the actual cost of implementing those reduction^. We do so by introducing term graph rewriting, 
following pQ but adapting the framework to call-by-value constructor rewriting. Contrarily to 
what we did in Section [31 we will stay abstract here: our attention will not be restricted to the 
particular graph rewrite system that is needed to implement reduction in the A-calculus. 

We refer the reader to our [5] for more details on efficient simulations between term graph 
rewriting and constructor term rewriting, both under innermost (i.e., call- by- value) and outermost 
(i.e., call-by-name) reduction strategies. 

2 As mentioned in the introduction, see |16| for another proof of this with other means. 
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Definition 4 (Labelled Graph) Given a signature S, a labelled graph over S consists of a 
directed acyclic graph together with an ordering on the outgoing edges of each node and a (partial) 
labelling of nodes with symbols from X such that the out- degree of each node matches the arity of 
the corresponding symbols (and is if the labelling is undefined). Formally, a labelled graph is a 
triple G = (V, a, S) where: 

• V is a set of vertices. 

• a : V — > V* is a (total) ordering function. 

• 5 : V — ^ V is a (partial) labelling function such that the length of a(v) is the arity of S(v) if 
S(v) is defined and is otherwise. 

A labelled graph (V, a, 6) is closed iff S is a total function. 

Consider the signature S = {a, b, c, d}, where arities of a,b,c,d are 2, 1, 0, 2 respectively, and b, 
c, d are constructors. Examples of labelled graphs over the signature X are the following ones: 



n 

b 

i 

d 



_L 




b 

1 

_L 

The symbol _L denotes vertices where the underlying labelling function is undefined (and, as a 
consequence, no edge departs from such vertices). Their role is similar to the one of variables in 
terms. 

If one of the vertices of a labelled graph is selected as the root, we obtain a term graph: 

Definition 5 (Term Graphs) A term graph, is a quadruple G = (V, a, 8, r), where (V,a, S) is 
a labelled graph and r G V is the root of the term graph. 

The following are graphic representations of some term graphs. 



I 



_L 




b c 

1 

1 

The root is the only vertex drawn inside a circle. 

There are some classes of paths which are particularly relevant for our purposes 

Definition 6 (Paths) A path v\, . . . ,v n in a labelled graph G — (V, a, S) is said to be: 

• A constructor path iff for every 1 < i < n, the symbol S(vi) is a constructor; 

• A pattern path iff for every 1 < i < n, 8{vi) is either a constructor symbol or is undefined; 

• A left path iff n > 1, the symbol S(vi) is a function symbol and V2, ■ ■ ■ , v n is a pattern path. 

Definition 7 (Homomorphisms) An homomorphism between two labelled graphs G — (Vq, cxq, o~g) 
and H = {Vh,cxh,Sh) over the same signature X is a function ip from Vq to Vh preserving the 
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term graph structure. In particular 

6 H (if(v)) = S G (v) 
a H (ip(v)) = ip*(a G (v)) 

for any v 6 dom(5), where ip* is the obvious generalization of ip to sequences of vertices. An 
homomorphism between two term graphs G — (Vq, etc, 5g, r o) and H — (Vh, (xh, o~h, th) is an 
homomorphism between (V G , etc, S G ) and (Vh, olh, Sh) such that <p(rG) = ru- Two labelled graphs 
G and H are isomorphic iff there is a bijective homomorphism from G to H ; in this case, we write 
G = H. Similarly for term graphs. 

In the following, we will consider term graphs modulo isomorphism, i.e., G = H iff G = H. 
Observe that two isomorphic term graphs have the same graphical representation. 

Definition 8 (Graph Rewrite Rules) A graph rewrite rule over a signature £ is a triple p = 
(G,r, s) such that: 

• G is a labelled graph; 

• r,s are vertices of G, called the left root and the right root of p, respectively. 

• Any path starting in r is a left path. 

The following are examples of graph rewriting rules, assuming a to be a function symbol and b, c, d 
to be constructors: 

n 

b x ± 

I 

d 

/ \ 

b c 

X 

_L 



Definition 9 (Subgraphs) Given a labelled graph G = (Vg, etc, 3g) and any vertex v £ Vq, the 
subgraph of G rooted at v, denoted G J, v, is the term graph {Vqiv, ccgiv, ^Giv,rciv) where 

• Voir is the subset of Vg whose elements are vertices which are reachable from v in G. 

• aciv and Sqiv are the appropriate restrictions of ctQ and 5q to Vg[v 

• r Giv is v. 

Definition 10 (Redexes) Given a labelled graph G, a redex for G is a pair (p, ip), where p is a 
rewrite rule (H, r, s) and ip is an homomorphism between H { r and G such that for any vertex 
v e Vnir with v dom(8Hir), any path starting in p(v) is a constructor path. 

The last condition in the definition of a redex is needed to capture the call-by-value nature of the 
rewriting process. 

Given a term graph G and a redex ((H, r, s), ip), the result of firing the redex is another term 
graph obtained by successively applying the following three steps to G: 

1. The build phase: create an isomorphic copy of the portion of H J, s not contained in H J, r, 
and add it to G, obtaining J. The underlying ordering and labelling functions are defined 
in the natural way. 

2. The redirection phase: all edges in J pointing to <p(r) are replaced by edges pointing to the 
copy of s. If ip(r) is the root of G, then the root of the newly created graph will be the newly 
created copy of s. The graph K is obtained. 
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3. The garbage collection phase: all vertices which are not accessible from the root of K are 
removed. The graph I is obtained. 

(H,r,s) 

We will write G — A I (or simply G — > 7, if this does not cause ambiguity) in this case. 
As an example, consider the term graph G and the rewriting rule p = (H, r, s): 




There is an homomorphism ip from H [ r to G. In particular, ip maps r to the rightmost vertex 
in G. Applying the build phase and the redirection phase we get J and K as follows: 



b @ *b 




J K 
Finally, applying the garbage collection phase, we get the result of firing the redex (p, <p): 




b c 



Definition 11 A constructor graph rewrite system (CGRS) over a signature £ consists of a set 
of graph rewrite rules Q onT,. 

5.1 From Term Rewriting to Graph Rewriting 

Any term t over the signature E can be turned into a graph G in the obvious way: G will be a tree 
and vertices in G will be in one-to-one correspondence with symbol occurrences in t. Conversely, 
any term graph G over E can be turned into a term t over E (remember: we only consider acyclic 
graphs here). Similarly, any term rewrite rule t —> u over the signature E can be translated into 
a graph rewrite rule (G, r, s) as follows: 

• Take the graph representing t and u. They are trees, in fact. 

• From the union of these two trees, share those nodes representing the same variable in t and 
u. This is G. 

• Take r to be the root of t in G and s to be the root of u in G. 
As an example, consider the rewriting rule 

a(b(x),y) -> b(a(y,a(y,x))). 
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Its translation as a graph rewrite rule is the following: 




b L^a 




An arbitrary constructor rewriting system can be turned into a constructor graph rewriting 
system: 

Definition 12 Given a constructor rewriting system 1Z over £, the corresponding constructor 
graph rewriting system Q is defined as the class of graph rewrite rules corresponding to those in 
1Z. Given a term t, [t]g will be the corresponding graph, while the term graph G corresponds to 
the term (G)n- 

Let us now consider graph rewrite rules corresponding to rewrite rules in 1Z. It is easy to realize 
that the following invariant is preserved while performing rewriting in [TZ]g: whenever any vertex 
v can be reached by two distinct paths starting at the root (i.e., v is shared), any path starting at 
v is a constructor path. A term graph satisfying this invariant is said to be constructor- shared. 

Constructor-sharedness holds for term graphs coming from terms and is preserved by graph 
rewriting: 

Lemma 10 For every closed term t, [t]g is constructor-shared. Moreover, if G is closed and 
constructor-shared and G —* I, then I is constructor-shared. 

Proof. The fact [t]g is constructor-shared for every t follows from the way the [-]g map is defined: 
it does not introduce any sharing. Now, suppose G is constructor-shared and 

G (H ^> ] I 

where (H, r, s) corresponds to a term rewrite rule t — > u. The term graph J obtained from G 
by the build phase is itself constructor-shared: it is obtained from G by adding some new nodes, 
namely an isomorphic copy of the portion of H j s not contained in H J, r. Notice that J is 
constructor-shared in a stronger sense: any vertex which can be reached from the newly created 
copy of s by two distinct paths must be a constructor path. This is a consequence of (H, r, s) 
being a graph rewrite rule corresponding to a term rewrite rule t — > u, where the only shared 
vertices are those where the labelling function is undefined. The redirection phase preserves itself 
constructor-sharedness, because only one pointer is redirected (the vertex is labelled by a function 
symbol) and the destination of this redirection is a vertex (the newly created copy of s) which had 
no edge incident to it. Clearly, the garbage collection phase preserve constructor-sharedness. □ 

Lemma 11 A closed term graph G in Q is a normal form iff (G)-jz is a normal form. 

Proof. Clearly, if a closed term graph G is in normal form, then (G)n is a term in normal form, 
because each redex in G translates to a redex in (G)n- On the other hand, if {G)n is m normal 
form, then G is in normal form: each redex in (G)n translates back to a redex in G. □ 

Reduction at the level of graphs correctly simulates reduction at the level of terms, but only if the 
underlying graphs are constructor shared: 

Lemma 12 If G is closed and constructor-shared and G — ► /, then (G)n — » {I)n- 

Proof. The fact each reduction step starting in G can be mimicked by n reduction steps in (G)jz 
is known from the literature. If G is constructor-shared, then n = 1, because any redex in a 
constructor-shared term graph cannot be shared. □ 
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A counterexample, when G in not constructor-shared can be easily built: consider the term rewrite 
rule <z(c, c) — > c and the following term graph, which is not constructor-shared and correspond to 
a(a(c, c), a(c. c)): 




c c 



The term graph rewrites in one step to the following one 




c 



while the term a(a(c, c) , a(c, c)) rewrites to a(c, c) in £wo steps. 

As can be expected, graph reduction is even complete with respect to term reduction, with the 
only proviso that term graphs must be constructor-shared: 

Lemma 13 If t — > u, G is constructor- shared and {G)n — t, then G — > I, where (I)n = u. 

Theorem 3 (Graph Reducibility) For every constructor rewrite system 72. over S and for 
every term t over S, the following two conditions are equivalent: 

1. t — y n u, where u is in normal form; 

2. [t]g — > n G, where G is in normal form and (G)-jz = u. 

Proof. Suppose t — > n u, where u is in normal form. Then, by applying Lemma 1131 we obtain 
a term graph G such that [t]g — >™ G and (G)-jz = u. By Lemma [TU1 G is canonical and, by 
Lemma HU it is in normal form. Now, suppose [t]g — >™ G where (G)-ji = u and G is in normal 
form. By applying n times LemmaQSl we obtain that {{t]g)n — > " (G)n — u - But {{t]g)n = t and 
u is a normal form by Lemma [TT1 since [t]g and G are constructor shared due to Lemma fTTJl □ 

There are term rewrite systems which are not graph reducible, i.e. for which the two conditions 
of Theorem [3] are not equivalent (see [1]). However, any othogonal constructor rewrite system is 
graph reducible, due to the strict constraints on the shape of rewrite rules [15j . This result can 
be considered as a by-product of our analysis, for which graph rewriting is only instrumental. 

5.2 Lambda- Terms Can Be Efficiently Reduced by Graph Rewriting 

As a corollary of Theorcm[3]and Theorem[T] we obtain the possibility of reducing A-terms by term 
graphs over £$. To this purpose, we can use the CGRS corresponding to <£>: 

Corollary 1 Let M S A be a closed term. The following two conditions are equivalent: 

1. M — >™ N where N is in normal form; 

2. [[M]$]e —> n G where (((G)$))a = N and G is in normal form. 

However, there are some missing tales. Let us analyze more closely the combinatorics of graph 
rewriting in 0: 

• Consider a closed A-term M and a term graph G such that [[M]$]e G. By Proposition Q] 
and Lemma fT21 for every constructor c X) at appearing as a label of a vertex in G, N is a subterm 
of M. 

• As a consequence, if [[M]$]e — >* G — > H, then the difference \H\ — \G\ cannot be too big: at 
most \M\. As a consequence, if [[M]$]e — G then |G| < (n + 1)|M|. Here, we exploit in an 
essential way the possibility of sharing constructors. 

• Whenever [[M]$]e — * n G, computing a graph H such that G — ► H takes polynomial time in 
|G|, which is itself polynomially bounded by n and \M\. 
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Hence: 

Theorem 4 There is a polynomial p : N 2 — > N such that for every X-term M , the normal form 
of [[Af]$]e can be computed in time at most p(\M\, Time v (M)). 

As we mentioned in the introduction, this cannot be achieved when using explicit representations 
of A-terms. Moreover, reading back a A-term from a term graph can take exponential time, as we 
mentioned in the introduction. 

We can complement Theorem with a completeness statement — any universal computational 
model with an invariant cost model can be embedded in the A-calculus with a polynomial overhead. 
We can exploit for this the analogous result we proved in [4] (Theorem 1) — the unitary cost model 
is easily proved to be more parsimonious than the difference cost model considered in 0]. 

Theorem 5 Let f : E* — » E* be computed by a Turing machine M. in time g. Then, there are 
a X-term Nm and a suitable encoding r - n : E* — > A such that Nj^ r v n normalizes to r f(v) n in 
0(g(\v\)) beta steps. 



6 Variations: Call-by-Name Reduction 

Our purpose in this last section is showing that similar techniques can be applied to call-by-name 
evaluation of A-terms. 

In the previous sections, A-calculus was endowed with weak call-by-value reduction. The same 
technique, however, can be applied to weak call-by-name reduction, as we will sketch in this 
section. A is now endowed with a relation — >/j defined as follows: 

M -f h N 



{Xx.M)N -> h M{N/x} ML -> h NL 

Similarly to the call-by-value case, Timeh(M) stands for the number of reduction steps to the 
normal form of M (if any). Since the relation — ^ is deterministic (i.e., functional), Timeh(M) is 
well-defined. 

We need another CRS, called \&, which is similar to $ but designed to simulate weak call- by- 
name reduction: 

• The signature E* includes the binary function symbol app and constructor symbols c x> m for 
every Me A and every x £ T, exactly as E$. Moreover, there is another binary constructor 
symbol capp. To every term Me Awe can associate terms {M}*, [M]<p G V(^, T) as follows: 

{x}qi = x 

{Xx.M}^ = c x> m (xi, . . . , x n ), where FV(Ax.M) = x\ , . . . , x n 
{AINU = capp({M}*, {AT}*) 

[x] q, = X 

[\x.M]\& = c x> u (a?i, . . . , x n ), where FV(Ax.M) = x\ , . . . , x n 
[MN]* = app([A-f]^,{A^}^) 

Notice that {•}* maps lambda terms to constructor terms, while terms obtained via can 
contain function symbols. 

• The rewrite rules in 72.* are all the rules in the following form: 

app(c^ z ,capp(u;, /)) -> app(w,f) 

app(c ZlZ ,c XtM (xi,...,x n )) -> c XiM (xi,...,x n ) 
app(c z , w (capp(/,5f)),/i) app(/,.g) 

app(c z ,„,(ca;,Af(a;i,-.-,x n )),/i) -> c x , M (xi, ■ ■ ■ ,x n ) 
app(c VtN (yi,...,y m ),y) [iV]* 
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where M ranges over A-terms, N ranges over abstractions and applications, FV(Aa;.M) = 
Xi, . . . , x n and FV(Ay.iV) = t/i> . . . , y m . These rewrite rules are said to be ordinary rules. We 
also need the following administrative rule: 

app(capp(a;, y),z)^> app(app(x, y), z) 

The CTRS ^ is slightly more complicated than <i>: some additional overhead is needed to force 
reduction to happen only in head position. As usual, to every term t € V(\&, T) we can associate 
a term (t) A : 

(x) A = x 
(app(u,w)) A = (capp(u,w)} A = (u) A (v) A 

(c x .M(t 1 ,...t n )) A = (\x.M){(t 1 ) A /xi,...,(t n ) A /x n \ 

where FV(Xx.M) = x%, . . . , x n . A term t £ T('i') is canonical if either t = c x ^i{t\ ■ ■ ■ ,t n ) € C(^) 
or t — app(u, v) where u is canonical and v S C(\&). 

Lemma 14 For every closed M 6 A, [M]^ is canonical. 

Proof. By a straightforward induction on M. □ 

The obvious variation on Equation [1] holds here: 

{[M^ih/xt, t n /x n }) A = M{(t 1 ) A /x ll (t n ) A /x n }. (2) 

^ mimics call-by-name reduction in much the same way $ mimics call-by-value reduction. How- 
ever, one reduction step in the A-calculus corresponds to n > 1 steps in ^E", although n is kept 
under control: 

Lemma 15 Suppose that t G T(^) is canonical and that t — > u. Then there is a natural number 
n such that: 

1- (t) A ->h (u) A ; 

2. There is a canonical term v £ such that u — > n v; 

3- |w| ap p = Mapp + rn whenever u —> m w and m < n; 

4- {w) A — (u) A whenever u w and m < n. 

Proof. A term t is said to be semi-canonical iff t = app(u,w), where v € C('I') and u is cither 
semi-canonical or is itself an element of C(^>). We now prove that if t is semi-canonical, there 
there are a natural number n and a canonical term u such that: 

• t u; 

• Mapp = KUpp + "rn whenever t — > m v and m < n; 

• { v )a = {t) A whenever t — > m v and m < n. 

We can proceed by induction on \t\. By definition t is always in the form app(w, d). We distinguish 
three cases: 

• w is semi-canonical. Then, we get what we want by induction hypothesis. 

• w is in C(^E') and has the form c^a/^i, ■ • • , t m ). Then, n = and t is itself canonical. 

• w is in C(^) and has the form capp(e, /). Then 

t = app(capp(e, /), d) app(app(e, /), d) 

We can apply the induction hypothesis to app(e, /) (since its length is strictly smaller than 
1*1). 

We can now proceed as in Lemma |6l since whenever t rewrites to u by one of the ordinary rules, 
u is semi-canonical. □ 

Lemma 16 A canonical term t € T($>) is in normal form iff (t) A is in normal form. 
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Proof. Wc first prove that any canonical normal form t can be written as c Xj M(ti, ■ ■ ■ > tn), where 
ti, . . . ,t n € C(\P). We proceed by induction on t: 

• If t = c Xt M(ti, ...,t n ), then the thesis holds. 

• If t = app(u, v), then u is canonical and in normal form, hence in the form c^jvf (ti, . . . , t n ) by 
induction hypothesis. As a consequence, t is not a normal form, which is a contraddiction. 

We can now prove the statement of the lemma, by distinguishing two cases: 

• If t = c X) m(*i, • • • j t n ), where t±,...,t n G C(^), then t is in normal form and (t)\ is an 
abstraction, hence a normal form. 

• If t = app(w, v), then t cannot be a normal form, since u is canonical and in normal form and, 
as a consequence, it can be written as c x< m (t\, . . . , t n ). 

This concludes the proof. □ 

Observe that this property holds only if t is canonical: a non-canonical term can reduce to another 
one (canonical or not) even if the underlying A-term is a normal form. 

Lemma 17 If M — >/, N , t is canonical and (t)\ = M , then t — ► u, where (u)\ = N and 

|^|app 1 ^ |^|app- 

Proof. Similar to the one of Lemma Q~7] □ 

The slight mismatch between call-by-name reduction in A and reduction in \t is anyway harmless 
globally: the total number of reduction step in 4" is at most two times as large as the total number 
of call- by-name reduction steps in A. 

Theorem 6 (Term Reducibility) Let M 6 A be a closed term. The following two conditions 
are equivalent: 

1. M —>-^ N where N is in normal form; 

2. [M]* — * m t where ((f)) a = N and t is in normal form. 
Moreover n < m < 2n. 

Proof. Suppose M — N, where N is in normal form. M is closed and, by Lemma [HJ [M]y is 
canonical. By iterating over Lemma [TBI and Lemma fTTl we obtain the existence of a term t such 
that (t)\ = u, t is in normal form and [M]-$ —> m t, where m > n and 

|£|ap P - |[Af]*| app >(m-n)—n. 

Since |i| app = (t is in normal form), m < 2n. If [M]-q> t where ((t))\ = N and t is in normal 
form, then by iterating over Lemma fT5l wc obtain that M — >^ N where n < m < In and N is in 
normal form. □ 

S is the graph rewrite system corresponding to in the sense of Section [K] Exactly as for the 
call-by-value case, computing the normal form of (the graph representation of) any term takes 
time polynomial in the number of reduction steps to normal form: 

Theorem 7 There is a polynomial p : N 2 — > N such that for every X-term M , the normal form 
of [[M]-q,]s can be computed in time at most p(\M\, Timeh(M)). 

On the other hand, we cannot hope to directly reuse the results in Section [4] when proving the 
existence of an embedding of CRSs into weak call-by-name A-calculus: the same A-term can have 
distinct normal forms in the two cases. It is widely known, however, that a continuation-passing 
translation can be used to simulate call-by- value reduction by call-by-name reduction [14] . The 
only missing tale is about the relative performances: do terms obtained via the CPS translation 
reduce (in call-by-name) to their normal forms in a number of steps which is comparable to the 
number of (call-by-value) steps to normal form for the original terms? We conjecture the answer 
is "yes" , but we leave the task of proving that to a future work. 
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7 Conclusions 



We have shown that the most naive cost models for weak call-by- value and call-by-name A-calculus 
(each beta-reduction step has unitary cost) and orthogonal constructor term rewriting (each rule 
application has unitary cost) are linearly related. Since, in turn, this cost model for A-calculus 
is polynomially related to the actual cost of reducing a A-term on a Turing machine, the two 
machine models we considered are both reasonable machines, when endowed with their natural, 
intrinsic cost models (see also Gurevich's opus on Abstract State Machine simulation "at the 
same level of abstraction", e.g. [8]). This strong (the embeddings we consider are compositional), 
complexity-preserving equivalence between a first-order and a higher-order model is the most 
important technical result of the paper. 

Ongoing and future work includes the investigation of how much of this simulation could be 
recovered either in a typed setting (see [T7] for some of the difficulties), or in the case of A- 
calculus with strong reduction, where we reduce under an abstraction. Novel techniques have to 
be developed, since the analysis we performed in the present paper cannot be easily extended to 
these cases. 

Acknowledgments 

The authors wish to thank Kazushige Terui for stimulating discussions on the topics of this paper. 

References 

[1] H. Barendregt, M. Eekelen, J. Glauert, J. Kennaway, M. Plasmeijer, and M. Sleep. Term 
graph rewriting. In J. de Bakker, A. Nijman, and P. Treleaven, editors, Volume II: Parallel 
Languages on PARLE: Parallel Architectures and Languages Europe, pages 141-158. Springer- 
Verlag, 1986. 

[2] Erik Barendsen. Term graph rewriting. In Terese (M. Bezem, J.W. Klop, and R. de Vrijer), 
editors, Term Rewriting Systems, chapter 13, pages 712-743. Cambridge Univ. Press, 2003. 

[3] Stephen Bellantoni and Stephen Cook. A new recursion-theoretic characterization of the 
polytime functions. Computational Complexity, 2:97-110, 1992. 

[4] Ugo Dal Lago and Simone Martini. An invariant cost model for the lambda-calculus. In 
Computability in Europe 2006, volume 3988 of LNCS, pages 105-114. Springer, 2006. 

[5] Ugo Dal Lago and Simone Martini. Derivational complexity is an invariant cost model. Int. 
Work, on Foundational and Practical Aspects of Resource Analysis, FOPARA, Eindhoven, 
2009. 

[6] Ugo Dal Lago and Simone Martini. On constructor rewrite systems and the lambda-calculus. 
In ICALP 2009, Part II, volume 5556 of LNCS, pages 163-174. Springer, 2009. 

[7] J.-Y. Girard. Light linear logic. Inform, and Comp., 143(2):175-204, 1998. 

[8] Yuri Gurevich. The sequential ASM thesis. In Current trends in theoretical computer science, 
pages 363-392. World Scientific, 2001. 

[9] Simon Peyton Jones. The Implementation of Functional Programming Languages. Prentice 
Hall, 1987. 

[10] Daniel Lcivant. Ramified recurrence and computational complexity I: word recurrence and 
poly-time. In Feasible Mathematics II, pages 320-343. Birkhauser, 1995. 

[11] Jean- Yves Marion and Jean- Yves Moyen. Efficient first order functional program interpreter 
with time bound certifications. In Logic for Programming and Automated Reasoning, 7th 
International Conference, Proceedings, volume 1955 of LNCS, pages 25-42. Springer, 2000. 



20 



[12] Michel Parigot. On the representation of data in lambda-calculus. In 3rd Workshop on 
Computer Science Logic, Proceedings, volume 440 of LNCS, pages 309-321. Springer, 1990. 

[13] Michel Parigot and Paul Roziere. Constant time reductions in lambda-caculus. In Mathe- 
matical Foundations of Computer Science 1993, 18th International Symposium, Proceedings, 
volume 711 of LNCS, pages 608-617. Springer, 1993. 

[14] Gordon D. Plotkin. Call-by-name, call-by-value and the lambda-calculus. Theoretical Com- 
puter Science, 1(2):125-159, 1975. 

[15] Dctlcf Plump. Graph-reducible term rewriting systems. In Graph- Grammars and Their 
Application to Computer Science, pages 622-636, 1990. 

[16] D. Sands, J. Gustavsson, and A. Moran. Lambda calculi and linear speedups. In The Essence 
of Computation: Complexity, Analysis, Transformation. Essays Dedicated to Neil D. Jones, 
number 2566 in LNCS, pages 60-82. Springer Verlag, 2002. 

[17] Zdzislaw Splawski and Pawel Urzyczyn. Type fixpoints: Iteration vs. recursion. In J^th 
International Conference on Functional Programming, Proceedings, pages 102-113. ACM, 
1999. 

[18] Peter van Emde Boas. Machine models and simulation. In Handbook of Theoretical Computer 
Science, Volume A: Algorithms and Complexity (A), pages 1-66. MIT Press, 1990. 

[19] Christopher Wadsworth. Some unusual A-calculus numeral systems. In J. P. Seldin and J.R. 
Hindley, editors, To H.B. Curry: Essays on Combinatory Logic, Lambda Calculus and For- 
malism. Academic Press, 1980. 



21 



